Huddle Based Harmonic K Means Clustering Using Iterative Relocation on Water Treatment Plant Data Attributes
نویسندگان
چکیده
Clustering is one of the important methods in data mining to weight the distance between the cluster objects effectively. Existing K-means algorithm with Affinity Propagation (AP) algorithm captures the structural information of texts, and improve the semi supervised clustering process. Seeds Affinity Propagation (SAP) provides the detailed distance measurement but it takes the longer execution time to perform the clustering process. Weighted Clustering Ensemble (WCE) algorithm on the existing work provides an effectual technique on the temporal data clustering. WCE are not effective in developing the speed clustering on the data mining system. To reduce the execution time on clustering of the large capacity of dataset, Huddle based Harmonic K-means Clustering (HH K-means Clustering) mechanism is proposed in this paper. HH Kmeans Clustering first performs the initial screening process to divide them into a cluster and non-cluster data form. Secondly, HH K-means Clustering develops the Iterative Relocation (IR) technique to improve the speed of the clustering process. Iterative Relocation (IR) technique calculate and estimates the cluster center objects (i.e.,) Centroids in HH K-means Clustering. The IR technique performs the initialization, relocation of objects and cluster updating process to reduce the computation of distance measure on the every data objects. Huddle groups the data objects using the Harmonic K-means Clustering and calculate the clustering time and clustering rate. So, HH K-means clustering add some improved conditions and factors to enhance the effectiveness of data mining clustering system with approximately 7 % improved cluster rate. Experiment is conducted using Water Treatment Plant Data Set on the factors such as execution time, huddle based clustering rate and entropy level.
منابع مشابه
Persistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm
Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...
متن کاملGROUND MOTION CLUSTERING BY A HYBRID K-MEANS AND COLLIDING BODIES OPTIMIZATION
Stochastic nature of earthquake has raised a challenge for engineers to choose which record for their analyses. Clustering is offered as a solution for such a data mining problem to automatically distinguish between ground motion records based on similarities in the corresponding seismic attributes. The present work formulates an optimization problem to seek for the best clustering measures. In...
متن کاملMASTER'S SEMINAR ANNOUNCEMENT Department of Statistics Model-Based Clustering Via the Bayesian Information Criterion with Applications and Comparison with K-means Algorithm
Data mining aims to discover unknown knowledge in data sets. An important step in data mining is the discovery of structural features in a data set without using prior knowledge. Here we consider the situation in which the training data is not available, known as the clustering analysis. One difficult question we are often faced with in clustering analysis is how to choose the number of cluster...
متن کاملCUSTOMER CLUSTERING BASED ON FACTORS OF CUSTOMER LIFETIME VALUE WITH DATA MINING TECHNIQUE
Organizations have used Customer Lifetime Value (CLV) as an appropriate pattern to classify their customers. Data mining techniques have enabled organizations to analyze their customers’ behaviors more quantitatively. This research has been carried out to cluster customers based on factors of CLV model including length, recency, frequency, and monetary (LRFM) through data mining. Based on LRFM,...
متن کاملA Hybrid Data Clustering Algorithm Using Modified Krill Herd Algorithm and K-MEANS
Data clustering is the process of partitioning a set of data objects into meaning clusters or groups. Due to the vast usage of clustering algorithms in many fields, a lot of research is still going on to find the best and efficient clustering algorithm. K-means is simple and easy to implement, but it suffers from initialization of cluster center and hence trapped in local optimum. In this paper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014